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(57) Abstract 

An advanced imaging 
spectrograph system (10) includes an 
electrophoresis device (12). an optical 
imaging device (14), and a processor 
(16), configured to provide for slab-gel 
DNA sequencing and genotyping with 
high throughput sequencing. The 
system is based on the integration of 
improved electrophoresis structures 
with an imaging spectrophotometer that 
records the entire emission spectra along 
an imaging line across a sequencing gel 
(or capillary array). In one embodiment, 
the electrophoresis device (12) includes 
a large number of parallel lanes (18) 
formed in an electrophoresis gel (20) 
which is sandwiched between two 
flat rectangular glass plates (22). The 
system includes spectral shape matching 
to improve dye identification allowing 
the use of dyes having nearly any 
emission spectra and allowing greater 
than four dye muldplexing. 
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AUTOMATIC SEQUENCER/GENOTYPER HAVING EXTENDED 
SPECTRAL RESPONSE 

5 pAPynROUN n OF THE INVENTION 

The present invention relates to DNA sequencing and genotyping. and more 
particularly, to DNA sequencers and genotypers that use optical fluorescence detection techniques. 

The basic biological characteristics of a living organism are contained in its genes 
or genetic code. In humans, for example, a person's biological characteristics are controlled by the 
10 genetic code contained in 23 chromosome pairs. Each chromosome contains differing genes. 

The specific details of a genetic code are contained in long double helical 
molecules called deoxyribonucleic acid or DNA. The DNA consists of long sequence pairs of four 
nucleotides or bases: adenosine, cytosine. guanosine or thymidine, commonly referred to by the 
letters A, C, G. and T. respectively. In the double helix, the A and T nucleotides are complementary 
15 and the C and G nucleotides are complementary. Thus* the DNA molecules consist of two 
complementary strands that are bound together by the complements. 

It is often advantageous to know the sequence of the DNA nucleotides associated 
with a particular gene. For example, genetic defects can be detected by analyzing an organism's 
genes. The DNA nucleotides for several bacteria and viruses have been sequenced, and currently, 
20 sequencing of the entire human genome is in progress. The entire human DNA consists of 
approximately 3 billion nucleotides or base pairs. 

Existing high speed DNA sequencers use electrophoresis gel techniques, in 
conjunction with fractioning enzymes and fluorescent tags or markers, to separate residual DNA 
sequence fragments as they travel through a gel. More specifically, each DNA fragment has an 
25 incrementally different molecular weight and size. Because the mobility is related to the fragment's 
weight, structure, and charge, the differing fragments travel through the gel at a differing speeds. 
Thus, the time it takes a fragment to travel through the gel relates to the fragment's mobility. 

Generally, four fluorescent tags are used. These tags bind on the residual 
fragmente in accordance with the exposed end base, if using dye terminator chemistry, or are 
30 attached to primers that are used to initiate the sequencing reaction, if using dye primer chennistry. 
The sequence is read by causing the fluorescent markers to fluoresce. The four fluorescent tags 
generally are selected to have a strong fluorescence peak that is separated from the strong 
fluorescence peak of the remaining tags. An optical instrument detects the emitted fluorescence 
signals. 

35 Existing DNA Sequencers use an optical filter having a pass-band that is centered 

about the appropriate wavelength to distinguish between the dyes, and thus the fragments. The 
optical instrument typically includes a simple spectrometer or a filter wheel and a photomultiplier. 
The filter wheel has several colored filters, each filter passing light within a wavelength band 
corresponding to the spectral peak of one of the tags. A simple spectrograph has a wavelength- 
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depeodent light disperser such as a prism. The light disperser spreads, generally aiong a line, the 
different wavelengths of fluorescent light from the DNA fragments traveling in the gel. Four 
detectors are placed along the spreading line of the spectrograph at differing locations that correspond 
to the wavelengths associated with the fluorescent tags. 
5 Fluorescent dyes have been found to be good fluorescent tags. Using dye primer 

chemistry, the tag associated with the C base often is fluorescein-5-isothiocynate (FITC) which has 
an emission or fluorescence peak at about 525 nanometers. The tag associated with the T base often 
is Texas Red, which has a fluorescence peak at about 620 nanometers. The tag associated with the G 
base often is Tetramethyl rhodamine isothiocynate (TRITC), which has a fluorescence peak at about 
10 580 nanometers. Finally* the marker associated with the A base often is 4<fluoro-7nitro- 
benzofurazan (NBD-fluoride) which has a fluorescence peak at about 540 nanometers. 
Commercially, four universal primers, respectively labeled with dyes called FAM (C), TAMRA (G), 
JOE (A), and ROX (T), are available from Applied Biosystems, Inc. (ABI) of Foster City, 
California. 

15 The fluorescent dyes indicated above are subject to bleaching which limits the 

excitation light's power level and thus the intensity of the emitted fluorescence signal from the dyes. 
The upper limits on fluorescence intensity limit the signal-to-noise ratio (SNR) and eventually the 
system's throughput. 

Accordingly, there exists a need for a sequencing or genotyping system that has 

20 increased throughput and sensitivity over systems using four dyes that are distinguished by their 
respective fluoresce peaks. The present invention satisfles these needs. 



SUMMARY OF THE INVENTION 

A sequencing or genotyping system includes an imaging spectrograph that records 

25 the entire emission spectra across a plurality of lanes in an electrophoresis sequencing gel. The 

system includes spectral shape matching to improve dye identiflcation, thereby allowing the use of 
dyes having nearly any emission spectra and allowing greater than four dye multiplexing. 

In a flrst embodiment of the invention, the system includes a plurality of 
electrophoresis lanes. Each lane has a respective first and second end, and an electrophoresis medium 

30 between the flrst and second ends. Each lane is loaded with fluorescent! y -tagged charged molecules 
having differing mobilities and chemical properties. An electrical potential, of appropriate polarity, 
is applied between the first and second ends and causes charged molecules applied at the first end to 
travel toward the second end at a rate proportional to each molecule's mobility such that the charged 
molecules are separated along the lane based on the molecule's mobility. A read zone extends 

35 substantially along an image line and intersects the plurality of electrophoresis lanes near the second 
ends. The system also includes a light source and an imaging spectrometer. The light source 
illuminates the read zone with exciution light to cause the charged molecules to fluoresce and 
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produce fluorescent light. The imaging spectrometer spectrally images the Huorescent light onto a 
two-dimensional imaging plane. The first dimension of the imaging plane is associated with a 
distance along the image line of the read zone and the second dimension is associated with the 
wavelength of the fluorescent light. The imaging spectrometer simultaneously images the fluorescent 
5 light onto the two- dimensional imaging plane without any scanning motions or delays. 

The system may further include a camera having a two dimensional pixel array. 
The camera generates video signals based on the intensity of light incident upon the pixel array. A 
display may display a graph of the chemical properties of the molecules crossing the imaging line 
verses time. 

jQ In more detailed features of the invention, the imaging spectrograph further 

comprises a linear entrance aperture with discrete locations along the aperture corresponding to 
locations along the first dimension of the image plane. Further, a plurality of optical fibers couple 
the fluorescent light from the read zone to corresponding locations along the entrance aperture. 
Alternatively, the imaging spectrograph may include an optical lens system that directly images the 
1 5 read zone. Further, the system may include a processor that compares the detected fluorescent light 
received from molecules of a given mobility with reference spectral profiles for the fluorescent tags 
to identify the associated fluorescent tag and thus the molecule's associated chemical property. 

In other more detailed features of the invention, associated with genotyping or 
DNA fingerprinting, the charged molecules are at least 10 differing genetic markers of a. genome and 
20 a fluorescent tag. having a unique spectral profile, is associated with each genetic marker. The 

processor compares the detected fluorescent light with the reference spectral profiles by calculating 
weighting factors, for each reference spectral profile, based on a deconvolution of the detected 
fluorescent light with the reference spectral profile for each of the fluorescent tags. Further, at least 
one fluorescent dye may be attached to charged molecules of a known mobility, to provide a mobility 
25 calibration reference. The processor calculates the mobility of any charged molecules of unknown 
mobility, based on the unknown molecule's travel rate and the mobility calibration reference. 
Additionally, the fluorescent tags used in adjacent lanes may differ from each other. The processor 
tracks lane drift along the image line using the difference between the fluorescent tags in adjacent 
lanes. 

30 In yet other more detailed features of the invention, the charged molecules are 

DNA fragments and each fragment is tagged with an atuched fluorescent dye that identifies the 
fragment's end base. The fluorescent tags are four primer dyes: FTTC, TRITC. NBD-fluoride, and 
Texas Red, corresponding to the bases C, G. A, and T, respectively. The light source generates 
excitation light at wavelengths of about 488 nanometera and 514 nanometers. 

35 The lanes are on a planar substrate and the excitation light is a laser beam that 

travels along the image Une to simultaneously illuminate the lanes. A mirror reflects the laser beam 
back through the read zone along the image line to increase the intensity and uniformity of the 
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fiuorescent light. A lens couples the laser beam from a laser to the gel to increase the intensity of the 
exciUtion light traveling along the image line. 

Further, optical fibers may couple the fluorescent light from the image line to an 
entrance slit on the imaging spectrometer. A cylindrical lens focuses the fluorescent light from the 
5 imaging line onto ends of the optical fibers and a mirror is located behind the read zone such that 
fluorescent light travelling away from the fiber end is reflected back toward the optical fiber ends. 

In yet other more detoiled features of the invention, the second dimension of the 
imaging plane corresponds to the spectral range from about 450 nanometers to about 900 nanometers 
and the wiaveiength within the spectral range of the second dimension so that resolution along the 
10 second dimension is less than four nanometers. Also, the excitation light is narrowband light having 
a wavelength within the spectral range of the second dimension so that excitation light scattered by 
the electrophoresis gel is imaged on a pixel array on the imaging plane. The electrical signals, 
generated by the pixel array based on the intensity of light collected by pixels corresponding to the 
wavelength of the excitation light, provide a monitor of the excitation light's intensity. 

The system may further include first and second electrodes for applying the 
electrical potential to the lanes at the first and second ends, respectively. The first and second 
electrodes extend across all of the plurality of lanes. Also, a plurality of loading electrodes may be 
situated in each lane near the first electrode for loading the charged molecules into the lanes. 

The electrophoresis medium may be a gel having a thickness of 200 microns which 
20 is sandwiched between two glass plates. An index matching buffer may optically couple the light 
from the light source to electrophoresis medium. The read zone may fiirther include fluorescent 
markers at each end of the image line for indicating the read zone. Also, the plurality of 
electrophoresis lanes may include 384 separate parallel lanes. A heater may be thermally coupled to 
the electrophoresis medium for maintaining the medium within a predetermined temperature range. 
25 The present invention is further embodied in a niethbd for sequencing DNA. First, 

DNA fragments that are tagged with fluorescent dyes are produced. The dyes indicate an end base 
associated with the respective DNA fragment. Next, the DNA fragments are separated according lo 
mobility using electrophoresis of the fragments on a plurality of electrophoresis lanes. The separated 
DNA fragments form fragment groups of slightly different mobility. Next, the fragment groups are 
30 excited with excitation light causing the fluorescent lags to fluoresce. Next, a hyperspectral image is 
formed of the separated DNA fragments. The hyperspectral image simultaneously covers all of the 
electrophoresis lanes and a broad spectral range. Next, the fluorescent dye associated with DNA 
fragments of particular molecular weight is identified by fitting the spectra emitted by a fragment 
group with reference spectra associated with the fluorescent dyes. 

In another method of identifying molecules tagged with fluorescent dyes, in 
accordance with the present invention, a hyperspectral profile is formed of a molecules' fluorescence 
emission. The hyperspectral profile covers a wavelength range from 450 nanometers to 900 
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nanometers with a resolution of less than 5 nanometers. Next, the hyperspectral profile is convoluted 
with a reference spectral profile of each dye over the entire spectral range to generate a weighting 
factor for each dye. Finally, the contribution of a particular dye to the molecules' fluorescence 
emission is indicated based on the weighting factor. 
5 The present invention is additionally embodied in a method for loading charged 

molecules of differing mobilities into a plurality of closely spaced electrophoresis lanes. Each lane 
has first electrophoresis electrodes located at a first end of the lane, a second electrophoresis 
electrodes located at a second end of the lane, and a loading electrode located near the lane's first 
end. First, a solution having the charged molecules is applied over the loading electrode of the first 
10 lane. Next, a first voltage is applied to the first and second electrophoresis electrodes of the first 
lane. The first voltage has the same polarity as the charged molecules. The charged molecules are 
then loaded into the first lane by applying a large second voltage, having a polarity opposite the 
polarity of the charged molecules, to the loading electrode of the first lane. A third voluge is 
applied to the loading electrode of previously loaded lanes while the loading process is repeated for 
15 the remaining electrophoresis lanes. The third voltoge is a reduced value of the first voltage. 

After loading the lanes, a fourth voltage is applied between the first electrophoresis electrode and the 
second electrophoresis electrode to cause the charged molecules to migrate through the 
electrophoresis lane at a rate proportional to the molecules mobility. 

In this method, the charged molecules are negatively charged DNA fragments. 
20 Further, the first voltoge is between about -1 and -2 volts, the second voltage is between about + 1 
' and +2 volts, the third volUge is between about -0. 1 and -0.2 volts, and 
the fourth voltage is between about -2000 and -4000 volts. 

Finally, the present invention is further embodied in a method of identifying 
fluorescently-togged DNA base fragment groups. First, the fragment groups are excited with 
25 excitotion light to cause a fluorescrait Ug associated with a respective fragment group to fluoresce and 
produce fluorescrat light. Next, the fiuoiescent light emitted by each fragment group is detected 
over a sufficiently large wavelength range to produce a spectral profile of the fluorescent light from 
each fragment group. Next, the detected spectral profile from each fragment group is compared with 
reference spectral profiles associated with the fluorescent tags to generate a weighting factor. 
30 Finally, the fluorescent tags associated with each fragment group are identified based on the fragment 
group's weighting factor. In a more deUiled feature of the method, the wavelength range extends 
from about 450 nanometers to about 900 nanometers. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The above and other aspects, features and advantages of the present invention will 
be more apparent from the following more particular description thereof, presented in conjunction 
with the following drawings, wherein: 
5 FIG. 1 is a block diagram of the imaging spectrograph sequencer/^enotyper 

embodying the present invention. 

FIG. 2 is a graph comparing the output intensity as a function of time for one 
fluorescent dye attached to DNA molecules moving past the detection system of the imaging 
sequencer/genotype of the present invention, with the output intensity of an existing system. 
10 FIG. 3A is a graph of the emission spectra of four common fluorophores used as 

tags in existing systems. 

FIG. 3B is a graph of the emission spectra of the fluorophore NBD, indicating the 
additional light intensity that fails to be collected by a passband filter of the prior art and that is 
collected by the imaging spectrograph sequencer of the present invention. 
15 FIG. 4 is a perspective view of another embodiment of the imaging portion of an 

imaging spectrograph sequencer of the present invention. 

FIG. 5 is a perspective view of electrophoresis lanes in an electrophoresis gel» in 
accordance with the present invention, having electrostatic loading pads. 

FIG. 6 is a cross-section of the electrophoresis lanes of the apparatus of FIG. I, 
20 showing the orientation of the electrophoresis lanes* excitation Iight» detection optical fibers, and 
reflective mirrors, in accordance with the present invention. 

FIG. 7 is a perspective view of the successive frames, forming an image cube, 
indicating the relationship between the lanes, the wavelengths, and the time or mobilities. 

FIG. 8 A is an image of a lane trace showing raw data obtained using the imaging 
25 spectrograph of FIG. 1. 

FIG . 8B is an image of a line trace showing background data corresponding to the 
imaging spectrograph and the electrophoresis gel of FIG. 1 . 

FIG. 8C is an image of an lane trace showing the net result of the combination of 
the image of FIG. 8A minus the image of FIG. 88, yielding the emitted fluorescence signal from the 
30 lane trace. 

FIG. 9A is a single image from the image cube of Fig. 7, showing the fluorescence 
signal received from six lanes having differing dye tags and the corresponding emission spectra of the 
tagged DNA. 

FIG. 9B is a graph for the integrated wavelength values, over a spectral range of 
35 FIG. 9 A, verses lane. 

FIG. 10 is a graph showing the time*dependant emission of four dyes attached to 
DNA, as detected by the imaging spectrograph. 
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FIG. 1 1 is a graph showing the time-dependent emission of four dyes, attached to 
DN A and the corresponding bases derived from the four peaks. 

FIG. 12 is a flow diagram of the data acquisition and image processing methods of 

the present invention. 

5 Corre^nding reference characters indicate corresponding components throughout 

the several views of the drawings. 

ppT^jT pp T^PgrRTPTIQN THP TNVFNTION 

The following description is of the best mode presently contemplated for carrying 
1 0 out the invention. This description is not to be taken in a limiting sense, but is made merely for the 
purpose of describing the general principles of the invention. The scope of the invention should be 
determined with reference to the claims. 

As shown in exemplary drawings, and particularly in FIGS. 1, 5 and 6. the present 
invention is embodied in a sequencing or genotyping system 10 that records the entire emission 
15 spectra of fluorescent dyes atuched to nucleoprotein fragments. The entire spectra is then compared 
against reference spectra, corresponding to the fluorescent dyes, to identify the particular fluorescent 

dye associated with a fragment group. 

The system 10 includes an electrophoresis device 12, an optical imaging device 14, 
and a processor 16. The electrophoresis device (FIGS. 1 and 5) includes 384 lanes 18 formed in an 

20 electrophoresis gel 20. The electrophoresis gel 20 has a thickness of 200 microns and is sandwiched 
between two flat rectangular glass plates 22. The glass plates have a width of about 25 centimeters 
(cm) and a length of about 60 cm. The lanes 18. having a total width of aboul 23 cm, nm parallel 
with the length of the glass plates 22 between a firat or loading buffer chamber 24 and a second or 
collection buffer chamber 26. A read zone 28 is located across the lanes near the second buffer 

25 chamber. Synthetic gems, such as niby. placed at each end of the read zone, can act as fluorescent 
alignment markers 30 of the read zone. 

The fluorescently tagged nucieoproteins are placed in the first buffer chamber and 
introduced at a first end 32 of the lanes. A second buffer chamber at the second end 34 of the lanes 
receives the nucieoproteins after they have passed through the lanes. A negative electrode 36 is 

30 coupled to the first buffer chamber 24 and a positive electrode 38 is coupled the second buffer 

chamber 26. A high voltage potential of e.g.. 2000 volts, is applied between the negative electrode 
and the positive electrode causing the negatively charged nucieoproteins to travel through the 

electrophoresis gel 20. 

A power supply (not shown), having a volUge range up to 4000 volts at up to 200 
35 miUiamps. supplies the high voltage potential. The electrophoresis gel 20 is typically acrylamide and 
the nucieoproteins travel through the gel at a rate proportional to the nucieoprotein s molecular 
weight or mobility. More particulariy slightly lighter and smaller nucieoproteins. have a higher 
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mobility and travel through the gel faster than longer nucieoproteins, thereby arriving at the read 
zone 28 sooner. 

In DNA sequencing, the individual nucleoproteins are DNA residual fragments 
that having differing molecular weights and mobilities based on the location of the nucleotide or base 
5 at which the DNA molecule was fragmented. Tlie exposed base of the DNA fragment is coupled to a 
fluorescent dye that Ugs the fragment in accordance with the exposed base. Accordingly, as the 
fragments of differing mobilities pass by the read zone 28 and are "read" as "identified" or taught 
herein, they may be used to identify the sequence of the original DNA molecule. 

In genotyping, DNA firagments (markers), exhibiting a distribution of sizes and 
10 lengths forming a unique signature for the DNA sample, are analyzed. These distributions, with 
each individual marker tagged, can be multiplexed and read out in each lane in a genotyping 
operational mode. 

An argon ion laser 40 illuminates the read zone 28 with excitation light 42 directed 
from the side of the electrophoresis gel 20 causing all the lanes 18 to he simultaneously illuminated 

15 by the laser's excitation light. A lens 41 placed between the laser 40 and the edge of the 

electrophoresis gel 20 improves the excitation hght 42 coupling into the gel. Further, the plates 22 
act as a light guide by internally reflecting the exciuiion light. A matching solution or buffer 43 
may be used between the air and the gel to lower reflections caused by the air-gel interface. A laser 
beam reflecting mirror 44 (FIG. 6), located opposite the laser 40, reflects the excitation light 42 

20 passing through the read zone so that the excitation light 42 makes a second pass through the same 
region to increase the intensity and uniformity of the read zone illumination. The excitation light 42 
from the argon ion laser has.two principle wavelengths at approximately 488 nanometers and 514 
nanometers. These wavelengths are advantageous for causing a wide variety of fluorescent dyes to 
fluoresce. Additional or other lasers that provide excitation light 42 having a wavelength matching 

25 the excitation spectra of the dyes may be used. 

The fluorescent light emitted by the dyes is captured by and is focused onto a liner . 
fiber-optic array 46 by a cylindrical lens 48 or by a micro-lens assembly. A flat or cylindrically 
concave mirror 50, disposed opposite the optical fiber ends, reflects fluorescent light back through 
the gel toward the optical fibers 48. The fluoresce signal is coupled through the optical fibers to an 

30 entrance slit 52 of an imaging spectrograph 54 (FIG. 1). The entrance slit 52 is filled with an 

elongated line of optical fiber ends. The light passing through the slit 52 is wavelength resolved by 
the imaging spectrograph 54 in an astigmatic manner, thereby producing a two-dimensional image at 
an image plane 56. 

As shown in Fig. 7, the first dimension 58 of the image, as discussed in more detail 
35 below, is related to the wavelength of light at the slit 52. The second dimension 60 is related to the 
distance along the entrance slit 52 at which the incoming light passes through the slit and corresponds 
to distances along the read zone 28 which crosses the lanes 18. Accordingly, the imaging 
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spectrograph produces a simultaneous spectral image of all the electrophoresis lanes 18 at its image 
plane 56. 

A cooled CCD camera 62 is oriented in the image plane 56 and detects the two- 
dimensional image. A suitable camera would be the Photometries model Sensys 1600 of Tucson. 
5 Arizona, or the Scientific Imaging Technologies model S1424A. The camera converts the two- 
dimensional image into a digital video signal which is transmitted across a daU link 64 to a daU 

acquisition computer 16. 

The CCD camera 62 has a resolution of about 1 500 pixels across the spatial 
dimension 60 of the lanes 18. Thus, each of the 384 lanes is imaged onto about 4 pixels. Preferably. 
10 the pixels are binned together in either the spacial or spectral dimension, or both, during readout to 
lower the number of bytes in the digital video signal. Thus, the imaging spectrograph 54 and camera 

62 create a hyperspectral image of the lanes. 

A hyperspectral imager (HSl) 14 is distinguished from a multi-color spectrometer 
system which typically has several discrete spectral channels with broad-band properties (i.e.. several 
15 tens of nanometers in bandwidth). A typical hyperspectral imager has continuous channels less than 
5 nanometers wide. In the spectral dimension, the hyperspectral imager has a spectral resolution of 
about 3-4 nanometers, contiguously spaced between about 450 nanometers and 900 nanometers. 
Because the imaging spectrograph 54 provides a hyperspectral image for wavelengths between about 
450 and 900 nanometers, more of the emitted fluoresce signal is collected and used in idemifying the 
20 fluorescent dyes. This is particularly valuable since the emission spectra of the dyes overlap. 

FIG. 2 compares, at one wavelength, the signal (trace B) received from one dye in 
a lane of an existing system, and the signal (trace A) for the fluoresce dye received from a lane hy the 
system of the present invention. The genetic sequence is of the bacteriophage M 13mpl 8 taken with a 
monochrometer set at the wavelength of 520 nm. which corresponds to the fluorescence of the dye 
25 associated with the base cytosine (C). The other three bases are not marked with a dye. The sample 
size used in both traces is the same. Using the entire spectral content would result in even a higher 
signal for the system of the present invention- 
More particularly, as shown in FIGS. 3A and 3B, the spectral signal of common 
dyes encompass a very wide spectral band. Existing systems, however, merely look at a narrow 
30 window 66 (FIG. 3B) centered about the peak of a dye's fluoresce spectra to identify the particular 
fluoresce dye. Accordingly, the fluoresce signal falling in the spectra 68 outside the narrow window 
is ignored by existing systems and the energy within the envelope of the profile of the fluoresce 

signal is lost and not used. 

As discussed in more detail below, the processor 16 of the present invention 
35 compares the spectral profile of the received fluoresce signal with a reference profile for each of the 
dyes in use. A weighting factor between the spectral shape of the received signal and the reference 
spectra resulte in a more accurate determination of the dye associated with the fragment. 
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An alternative embodiment of the optical system 10' is shown in FIG. 4. In this 
embodiment, the excitation light 42' from the argon ion laser 40 is spread by an aspheric cyiindrical 
lens set 74 into a beam fan 70 that simultaneously illuminates the read zone 2B. A first turning 
mirror 76 and an electro-optical shutter 78 assist in controlling and directing the excitation light. A 
5 camera of the imaging device 14' has optics 72 that directly image the linear read zone 28. through a 
second turning mirror 80 and a slit 52', and spectrally disburses the fluoresce light to similarly 
produce a two-dimensional image. No fiber-optic bundle is used in this embodiment because the 
read zone 28 is directly imaged onto the image plane. A cylindrical lens 48' may be used just over 
the read zone 28. A thermal jacket or heater 8 1 may be coupled to the electrophoresis gel support to 

10 stabilize the gel temperature and thus, the lane uniformity. Like the previous embodiment, one 

dimension 58 of the two-dimensional image corresponds to a distance along the read zone 28, and the 
other dimension 60 corresponds to the wavelength range between about 450 and 900 nanometers. 

Referring again to FIG. 5, each lane 18 has an electrostatic loading pad 82. The 
loading pads 82 can have a dimension as small as 10 microns square. A sample is concentrated onto 

15 the pad 82 by placing a small sample onto or near the pad 82 and applying a negative voltage 

potential, about 1 volt, to the pad 82 that attracts the negatively charged fragments to the pad 82. 
After the charged fragments are loaded in the lane, a small voltage potential, on the order of 0. 1 to 
0.2 volts, is applied to the loading pad 82 while the loading process is repeated for the remaining 
lanes 18. The voltage potential at the electrostatic loading pad 82 is then removed, and a positive 1 

20 volt potential is applied, causing the charged molecules to leave the pad and enter the lane 18. The 
fragments are then run down along the lanes by applying a large negative voltage potential between 
the negative electrode 36 and the positive electrode 38. A sample as small as 14 picoliters can be 
placed in a lane 18 by pipetting the DNA sample onto or near a loading pad 82 and performing the 
loading process. 

25 Alternatively, a mechanical gel loading system (not shown) can precisely pipette 

samples to a gel *comb' region of the gel. The gel comb can be either a traditional comb or the 
electrostatic loading pads or concentrators discussed above. The mechanical system can be coupled 
to the sequencing/genotyping system 10 using a nuLogic 3-axis controller and a National Instruments 
I/O system that can control pipetting and access alignment electronics having IR sensors. Such a 

30 system can be mounted above the gel 20, in a region arranged to be near ground potential, and can 
transfer liquids from a chilled microwell plate (to minimize evaporation losses). The picopipeUor 
can utilize teflon microtube fibers reinforced with ceramic insulators. When used in conjunction 
with electrostatic loading pads 82 discussed above, the samples can be synchronized if necessary. If 
alternating lanes 18 loaded and started, and then the remaining lanes are loaded and started, finding 

35 the lanes is easier for the processor 16. The electrostatic loading pads 82 can gather charged DNA 
from the loading volume and hold it in the vicinity of a coated metalized or other conducting 
electrode (preferably indium tin oxide, gold, or platinum). 
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The electrostatic loading may be performed on or off the sequencer 10. If the 
loading is performed off the sequencer 10, then in the electrophoresis device 12 may be transported 
with the samples atuched, installed in the sequencer, and the samples released. In either case» the 
very small pads may be loaded from a much larger volume (microliters), and the samples may be 
5 concentrated and spacially located, thus narrowing the lanes and allowing more samples to be loaded 
across a given gel width. Electrostatic loading reduces the precision required to spacially place a 
sample volume in the lane, because precision is gained by the concentration of the sample at the pad 
82. Accordingly, the sequencer/genotyper 10 of the present invention can sequence with DNA 
samples as small as 140 nanograms or with cosmid samples as small as 12 nanograms. 
10 With reference to FIG. 7, it is seen the imaging spectrograph produces a series of 

frames as a function of the fragment length. Each frame has one dimension corresponding to the 384 
electrophoresis lanes, and another dimension corresponding to the wavelength spectra, between about 
450 to 900 nanometers. The timing between frames is set to gather fluorescent light from the 
fragment groups as they pass through the read zone. A frame rate of I to 2 frames per second is 
15 sufficiently rapid to image the fragments as they pass through the read zone. If the fluorescent light 
from a particular fragment group is included in a more than one frame, then the signal for that 
fragment group from successive frames can be summed together. 

As shown in FIG. 8A-8C, the processor 16 removes background information by 
taking a signal from the lanes 18 and subtracting a background scan to produce an enhanced scan. 
20 FIG. 8A is an image of a "raw" data frame that includes the fluorescent light from a lane along with 
"background noise," The fluorescrat light is produced by dyes associated with dye primer 
chemistry, discussed above. FIG. 8B is an image of the lane before the fluorescent dyes enter the 
read zone, indicating the background noise. The fixed pattern of the background noise in the CCD 
camera 62 is determined from field flats and the dark current from dark frames (FIG. 8B). The 
25 background noise is processed out of the raw data (FIG. 8A) so that only the dye's spectral signature 
remains (FIG. 8C). However, noise components produced in the statistical "Poisson" detection 
process cannot be eliminated. 

As shown in FIGS. 9A, the imaging spectrograph 10 of the present invention 
produces an image of the entire emission spectra of a fluorescent dye. The emitted fluorescent light 
30 is produced by dyes associated with dye primer chemistry. As discussed in more detail below, the 
processor correlates the measured spectra with reference spectral profiles, and based on the 
correspondence of all the dye's fluoresced light, calculates a weighting factor. Based on the 
weighting factor, the processor is able to identify the particular fluoresce dyes, and then identify the 
corresponding base. In FIG. 9B, the graph shows the integrated light intensity falling in the spectra 
35 ranges corresponding the dyes of the indicated bases. 

As shown in FIG. 10, the time dependent emission of the four dyes is shown in 
four traces. A trace file consists of data for the four traces. These dyes are associated with dye 
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terminator chemistry and are also available from ABI. Each trace represents a wavelength band 
corresponding to the respective dyes. The trace baselines are shifted to separate the curves for 
improved viewing. The top curve 84 is produced by the FAM dye terminator and is associated with 
the G base. The next curve 85 is produced by the TAM ARA terminator dye and is associated with 
5 the T base. The next curve 86 is produced by the R6G terminator dye and is associated with the A 
base. The bottom curve 87 is produced by the ROX dye and is associated with the C base. The 
processing system 16, because it has the entire spectral information, is able to resolve the presence of 
any or all of the dyes passing the read zone 28 in any given lane 18 even if the dyes arrive at the read 
zone simultaneously. Accordingly, the present invention allows the use of multiple dyes, or dye 
10 multiplexing, to enhance the performance of the sequencer or genoiyper 10. From this "trace" 
information the base letters can be assigned or called using the image processing software. By 
comparing each dye emission in the four traces by size, peak width, peak position, relative to other 
peaks, the sequence of letters is determined. 

An important feature of the invention is that new dyes may be used for multi-color 
15 lagging of base pairs, using six to eight, or more, fluorophores. TTiis is particularly of value for 

fragments marked for genotyping (DNA finger printing) where many fragments need to be measured 
simultaneously. The spectra of each specific fluorophore is stored in the image processing software. 
These known spectra permit spectral unmixing of detected multiple fluorophones, and by use of 
correlation (deconvolution) techniques, utilize the fiill energy in each dye spectra to enhance the 
20 base-calling detection and accuracy. The ability to more accurately resolve dyes with similar 

emission signatures (i.e. close peak wavelenths) using the entire emission spectra deconvolution 
expands the number of possible dyes used in a single run beyond those in current use. In particular, 
a new set of SYBR dimeric cyanine or Bodipy dyes, commercially available from Molecular Probes, 
Inc. of Eugene, Oregon are excitable with the 488 nm argon ion laser line, have potential as 
25 candidates for multiplexing and sequencing dye sets. The conunercially available dye set has 8 

members with peak emission wavelengths that span from about 460 to about 670 nanometers, within 
the identification range for the imaging spectrograph. 

Further, by using eight fluorescent dyes, the dyes can be alternated so that adjacent 
lanes do not use the same dyes. Alternatively, a fifth dye can be added to every second lane for use 
30 in lane tracking. 

Additionally, dyes that produce infrared emission light having a wavelength 
including 1300 mn may be used. These dyes would be excited with, for example, infrared diode 
lasers. Also, the spectral range of the imaging spectrograph would be shifted or extended to be 
responsive to the infrared emission light. 
35 FIG. 1 1 shows a time-trace of the fluorescence emission spectra for the four dyes. 

The traces are based on the intensity of light within the several pixels corresponding to the 
wavelengths associated with each base*s dye< The baseline for each trace is arbitrarily shifted to 



wo 98/08085 



PCT/US97/14642 



-13- 



better view the curve for each base. By comparing the magnitudes of the traces it is possible to 
identify or call the sequence of the DNA sample by delennining the dye moving past the optics. 

The spectral profile matching and data analysis is performed using a process shown 
in the flow diagram of FIG. 12. In the first step 90, the image that is captured in adjacent pixels are 
5 binned together. The CCD data acquisition software continuously dovmloads data from a CCD 
camera onboard buffer to the computer memory. The camera acquires from I to 2 frames a second. 
Each frame consists of 524.288 numerical values using a 12-bit pixel resolution. The dau may be 
transmitted via a SCSI port or a PCI bus directly to the computer s disk. 

In the next step 92, the dato from the CCD is to the processor 16 where, in the 
10 next step 94, the background dato is removed. In the next step 96, spectral matching with the stored 
library of fluorophore spectra (signature) is performed in order to generate trace' data (or files). 
The spectral matching to the first fluorophore is performed and then that information is amplitude 
subtracted from the raw daU. The residual data is then matched to the signature of the second 
fluorophore, noted, subtracted until all the fluorophore spectra have been removed and the residual is 
15 minimized. The simple linear mixing/un-mixing produces an equivalent of the standard four color 
trace file, but it represents a more advantageous method by operating on significantly more spectral 
data (instead of just the output from a set of 4 filters or spectral windows) . The method of the 
present invention yields significantly more sensitivity. The spectral unmixing detection algorithm, to 
determine the mix of fiuorophofiss in a pixel, is advanUgeous to remove spill-over' or leakage of 
20 signal from one channel' to another, which is typical on current machines especially since the broad 
emission spectra of the dyes overlap. 

In the next step, 98, each lane is tracked and extracted from the image. Because of 
transverse mobility nonuniformities. the lanes (migrating samples) slowly move horizontolly or 
laterally in ume. Each lane must therefore be followed or "tracked" across the image plane. Each 
25 tracked lane is extracted from the rest to give a "trace" that represents a single sample (step 100). 

The optical signal for each trace is analyzed and based on a trace generation, in step 102. the base is 
called. An ASCII file is generated having the base sequence in step 104. 

Alternatively, in step 106, the gel image can be constructed from the hyperspectral 
images. The gel image is the image cube where the depth in the wavelength dimension has been 
30 reduced to four, corresponding to the peak emission of each dye. The gel image can then be 
analyzed using conventional tracking and base-calling software (step 108). 

An added feature of the present invention relates to the use of glass plates having 
linear surface profiles to help eUminate lane distortion or tracking errors during electrophoresis. One 
such plate- constniction approach is to use a niUng engine similar to those used to make diffraction 
35 gratings to modify the gel plates. Such rulings would be very course by optical standards (less than 
10 lines per millimeter). The ruled slab can be used as a ma.ster. for replicating the gratings to the 
desired optical specification of flatness and surface quality. The attractive features of capillary 
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electropboresis can thus be realized in a slab gel without the attendant problems of cleaning and 
reusing capillary glass components. 

The wavelengths (488 nm and 5 14 nm) of the excitation light 42 from the argon ion 
laser 40 falls within the spectra range of the imaging spectrograph 54. Because the CCD camera 62 
5 has a large dynamic range, the signal from the pixels associated with these wavelengths collect 

excitation light scattered from the gel 20. The scattered light may be monitored to track and calibrate 
the laser's optical power level. Further, because the fluorescence signal from the dyes is matched 
using decon volution, the intensity spikes from the scattered exciution light do not interfere with 
calculating the weighting factors, thus avoiding the use of excitation light filters and external 
10 excitation light power monitors. 

OPTICAL PERFORMANrF 

As presently understood, a sequencing run on each of the 384 lanes of the slab gel 
requires about 12 nanograms of single stranded DNA template (MI3 insert plus vector of -8.2 kb). 

15 That sample size of 4.3x10 '^ mol or about 2.6x10^ molecules of DNA provides a fluoresce signal 
that is roughly equal to the signal from background fluorescence and scattering. For identifying 
cosmids, this corresponds to 1440 nanograms of template DNA. For DNA sequencing, a typical 
DNA cycle sequencing reaction produces 1000 fragment lengths in the sequencing ladder. Fifteen 
rounds of extension at 80% efficiency are performed and each fragment is a terminated fluorochrome 

20 labeled roolecule(pnmer or terminator). Accordingly, about 3.1x10' (2.6x0. 10^x0.8x15/1000) of 

each fragment length is in each lane. An assumption that all fragment lengths are produced equally is 
true to about a factor of 2 to 4. 

For a weak fluorophore, photo-bleaching occurs after about 3200 cycles. This will 
require 1.42x10" photons (3. 1x10^ * 3200/0.7) at 70% quantum efficiency (Q.E.) in the read 

25 volume of each lane. At an excitation wavelength of 488 nanometers, a photon has an energy equal 
to hv, or 4x10 '^ joules/photon. The total energy required to photo-bleach the weak fluorophore is 
6x10 ' joules of excitation light at a wavelength of 488nm. This energy level establishes the 
maximum useful level of excitation light energy per iane. More specifically. 2.3xlO*\joules (384 * 
6x10'*) per read event for all 384 traces is the maximum excitation energy. If the gel is read 6 times 

30 a second, the total exciution energy required for each read is 1.4x10 * watts at a wavelength of 
488nm. A laser having an optical power level in the milliwatt range, to allow for inefficiencies, 
should thus be used. 

The integrated fluorescence and Raman scatter from the gel /buffer are similar in 
magnitude. The gel background fluorescence is about equivalent to 2x10 '^ mol of fluorescein when 

35 excited at 488nm. Accordingly, the gel/buffer produces about 8x10^ photons/steradian (sr) 

(0.7*1.42x10" /47C) for a single read of a lane. In the apparatus of the present invention, the total 
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Raman and gel fluorescence background nuliance. U. has been found to be about twice this value or 
1.6x10" photons/sr. 

The fluorescent signal level can be calculated as the number of incident photons 
tin«s the quantum efficiency of the dye. Dye quantum efficiencies vary widely. Ethidium bromide 
5 hasaQ.E. of about 15% while SYBR Green I hasaQ.E. of about 80%. An intermediate Q.E. 
value of 50% will be used for this calculation. Accordingly, a typical signal level will be 7. ixlO'» 
photons (0.5*1.42)il0")into 4ii steradian, or a signal radiance. L.. of 7.15xl0"'/47t = 5.65xl0' 
photons/sr based on the maximum excitation level established above for photo-bleaching. 

Therefore, the contribution of the background fluorescence and Raman scattering 
10 (8x10' photons/sr) is approximately equal to the signal from the fluorescent dyes (5.65x10' 

photons/sr). This background level, in conjunction with a camera that has 12 to 14 bit resolution 
(2048 to 16.384 counts of intensity resolution) gives more than an adequate signal for deconvolution. 
peak fmding. and ultimately, base calling. 

15 ^yr.NiAl -TO-NOISH RATIO 

The calculation of the SNR of the optics and associated systems demonstrates that 
the signal detection noise is small. TTie signal detection noise is different from the background noise 
because the background noise can be subtracted away, but the signal detection noise appears as an 
undesirable signal component inherent in all detection devices. 
20 The SNR for each lane of the slab gel may he written as: 

' SNR=N./(SN„')"' 

where N. is the signal count and (SO « the sum of the square of the noise counts. For the SNR 

Calculation, the following defmitions apply: 

a = solid angle subtended by the collecting optics (steradian) 
25 t = transmission of optics (fraction) 

C = quantum efficiency of detector (electrons/photon) 

T = integration time on the CCD detector 

n = number of spectral pixels per lane on the CCD 

Accordingly N, is as follows: 
30 N.=£lt5L,/n electrons/pixel 

andLN%=nN^»' + N,k+nN^'-i-N. 

where: 

N. = the signal count, defined above (electrons) 
Njk = pixel dark noise (RMS electrons) 
35 Nrt =• pi*el (RMS electrons) 

Ntt = background count = £1 1 5 W (electrons) 
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For the signal and background counts, Ihe root of the mean value is taken to represent the variance of 
a normal distribution, and hence the noise. 

As discussed above, a suitable scientific CCD detector for use with the invention is 
available from Scientific Imaging Technologies, Inc., model S1424A, as a 2048 x 2048 camera. The 
5 camera's detector plane has 2048 pixels oriented to the spatial direction, and 2048 pixels to cover the 
spectral dimension. The pixels are 24/t x 24/i square and are binned in the spectral dimens 58 to 
better match the point spread function of the spectrograph. Hence, the spectra are sampled into 256 
pixels, these pixels containing all the detected photo-electrons emitted by the fluorophore's light. 
The spatial dimension 60 has 2048 elements to resolve the 384 lanes. Of course, the lanes can 
10 always be widen to, for example, 96 lanes across the gel, and proportionately more pixels would 
cover each lane. 

The pixel read noise is an inverse function of the readout rate (IMHz). With the 
pixels binned 8x on the CCD chip of the camera 62, there are 5.2x10*^ values to he read. 
Accordingly, the value of is: 
15 = 5 rms electrons 

A typical dark current at -30oC is le-/sec/pixel. With a binning of 8 pixels and 
sampling of the read area with l/x frames per second, then: 

Ndk = 8*T (electrons) 

For this calculation, x was .17 seconds, the optical efficiency, t, for the camera 62 was 0,3, and the 
20 quantum efficiency » ^, was 0.45. 

With the S1424A CCD chip, the spatial dimension 60 of the image plane 56 is 

nearly 49 mm square. The imaging spectrograph 54 is designed with a focal image plane 56 

sufficiently large to accommodate the large focal plane of the CCD chip with a focal ratio of f/4.0. 

The instrumental magnification is unity, and the gel slab 20 has 384 lanes 18 in a total width of 150 
25 nun resulting in the magnification onto the slit being 48/150=0.32. The field-of-view of the 

spectrograph 54 is only 0.2 radians. The derived solid angle Q is: Q«(Tc37.5V4)/750- = 1.96 x 10'^ 

steradians, where 750 millimeters is the distance between the spectrograph's fore-optics and the gel 

surface. 

30 SINGLE PIXEL SNR 

The SNR is calculated using the above assumptions. Before performing spectral 
correlations with the known emission spectral signatures of the fluorescent markers, it is 
advantageous to have an adequate SNR at the pixel level of the CCD camera 62. To determine 

the available SNR at the pixel level, a worst case scenario is assumed where the emission spectra to 
35 be detected is uniformly distributed across all 256 continuous and contiguous spectral binned pixels, 
and then solve for the SNR. 

SNR.. ^ . . = 7.5=17.5dB 
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A SNR of 17.5dB is sufficiently robust to allow a high detection probability. Further, the CCD 
camera 62 has sufficient dynamic range because the calculated signal counts are 5.8x10' electrons and 
the well pixel (or CCD) capacity of the CCD chip is 2.0x10* electrons. 

Two assumptions are made in the above analysis. The first assumption involves 
5 the range of fragment lengths of single stranded DNA. Each length was assumed to be produced in 
equal numbers by the endonuclease restricUon processes. Actually, smaller strands are sUtistically 
more likely to be produced than the longer fragments. This assumption over-predicts the SNR 
achievable on the longer fragments. 

The second assumption is that the fluorescent spectrum of the fluorophores is 
10 uniform across the spectral response of the imaging spectrograph 54. The .spectral features have a 

wide dynamic range and normally fail to utilize the entire spectral range of the instrument. Hence, 
the single pixel SNR is a spectral variable of wide dynamic range that is greatly underestimated. 

The frame rate required of the CCD camera 62 is determined by the rate of 
migration of the fragments across the read zone 28 of the gel 20. Frame rates of the present 
15 embodiment are less than 6 per second and can be much lower. The CCD's frame rate determines 
the laser power required. If the CCD frame rate is low. then the excitation energy can be applied 
over a longer period of time for a lower average laser power. One skilled n the art would recognize 
that, in practicing the present invention, a number of trade offs exist between data rate, read noise, 
dark noise, data processing speeds and data storage. 



20 



SPPrTRAI . C QTtPPT ATION 

The correlation is performed in software, not on the CCD chip. Stored templates 
of all known fluorophores are conqjared against each unknown measured complete spectra. This 
allows wide versatility in the selecUon and number of dyes used in any nin. The calculated values 
25 for a perfect correlation yields a SNR of 1.923. or 65dB. This value is found by substituting unity 
for the number, of pixels, n. in the numerator in the signal count equations discussed above. With 
this performance, the probability of detecting and correctly identifying a fluorescent marker is 
essentially unity and the probability of an error is very small. 

If more than one fluorophore terminator is present during the CCD signal 
30 integration time, these can be spectrally un-mixed using software that uses the limited number of 

spectra used in the experiment. This library of possible spectra is referred to in hyper-spectral image 
processing as "end-members". End-members are used linearly to determine the fraction of each 
which yields the minimum "residual" - the portion of the signal not attributed to a specific spectnim. 
usually noise. Hence, with spectral pixel un-mixing, more than one fluorophore may be detected and 
35 read during a single read. 

The unmixing is solved using the equation: 
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where: 

N == number of dyes, f^C^) = Ihe spectral emission of the dyes, = the spectral contribution of the 
dye or "weighting* factor and is found by deconvolution of 1 (X2<^ < A,)(A.2==450nm, Xj = 900nni). 

As is shown above, the instrumentation noise level is orders of magnitude smaller 
5 than the "biologicaP noise level for the sequencing sample. The typical amount of biological noise 
(sometimes called background by sequencing technicians) in the traces for existing machines varies 
greatly depending on the success of the cycle-sequencing reaction, which in turn depends on the 
sequencing target, and the quality and quantity of the target DNA preparation. In a sequencing 
cycle, the sample or biological noise can vary from about a few percent to ^10% and still the base- 
10 calling software of the invention performs well. This level is significantly above the 0.05 % 
contribution by the instrument (i.e.6Sdb SNR). 

Advantageously, a DNA sequencer of the present invention has the capability of 
sequencing 76,800 base pairs per hour. This rate is 5 to 30 times faster than existing DNA 
sequencers. 

15 While the invention herein disclosed has been described by means of specific 

embodiment and applications thereof, numerous modification and variations could be made thereto 
by those skilled in the art without departing from the scope of the invention set forth in the claims. 



wo 98/08085 



PCT/US97/14642 



-19- 

What is claimed is: 

1 . Apparatus for detecting charged molecules having differing mobilities and chemical 

5 properties and having fluorescent tags, each tag corresponding to a particular chemical property of 

the molecules, comprising: 

a plurality of electrophoresis lanes, each lane having a respective first and second 
end, and an electrophoresis medium between the first and second end of each lane, wherein an 
electrical potential, of appropriate polarity, is applied between the first and second end causes the 
10 charged molecules, applied at the first end, to travel toward the second end at a rate proportional to 
the molecule's mobility such that the charged molecules are separated along the lane based on the 

molecule's mobility; 

a read zone that extends substantially along an image line that intersects the 

plurality of electrophoresis lanes near the second ends; 

15 a light source that illuminates the read zone with excitation light to cause the 

charged molecules to fluoresce producing fluorescent light; and 

an imaging spectrometer that spectrally images the fluorescent light onto a two- 
dimensional imaging plane, the first dimension being associated with a distance along the image line 
of the read zone and the second dimension being associated with the wavelength of the fluorescent 

20 light. 

2. Apparatus for detecting fluorescent molecules as defined in Claim 1, further 
comprising a camera having a two dimensional pixel array, the camera generating video signals based 
on the intensity of light incident upon the pixel array. 

25 

3. Apparatus for detecting fluorescent molecules as defined in Claim 1 , wherein: 

the imaging spectrograph further comprises a linear entrance aperture with discrete 
locations along the aperture corresponding to locations along the first dimension of the image plane; 
and 

30 the apparanas fixrther comprises a plurality of optical fibers that couple the 

fluorescent light from the read zone to corresponding locations along the entrance aperture. 

4. Apparatus for detecting fluorescent molecules as defined in Claim 1» wherein the 
imaging spectrograph includes an optical lens system that directly images the read zone. 



35 



5. Apparatus for detecting fluorescent molecules as defined in Claim 1, further 
comprising a processor that compares the detected fluorescent light received from molecules of a 
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given mobility with reference spectral profiles for the fluorescent lags, to identify the associated 
fluorescent tag and thus the molecule's associated chemical property. 

6. Apparatus for detecting fluorescent molecules as defined in Claim 5, wherein: 
5 the charged molecules are at least 10 differing genetic markers of a genome; 

a fluorescent tag, having a unique spectral profile, is associated with each genetic 

marker; and 

the processor compares the detected fluorescent light with the reference spectral 
profiles by calculating weighting factors for each reference spectral profile based on a deconvolution 
10 of the detected fluorescent light with the reference spectral profile for each of the fluorescent lags. 

7. Apparatus for detecting fluorescent molecules as defined in Claim 5, wherein: 

at least one fluorescent dye is attached to charged molecules of a known mobility, 
to provide a mobility calibration reference; and 
15 the processor calculates the mobility of any charged molecules of unknown 

mobility, based on the unknown molecule's travel rate and the mobility calibration reference. 

8. Apparatus for detecting fluorescent molecules in Claim 4» wherein: 
the fluorescent tags used in adjacent lanes differ from each other; and 

20 the processor tracks lane drift along the image line using the difference between the 

fluorescent tags in adjacent lanes. 

9. Apparatus for detecting fluorescent molecules as defined in Claim 1 , further 
comprising a display that displays a graph of the chemical property of the molecules crossing the 

25 imaging line verses time. 

10. Apparatus for detecting fluorescent molecules as defined in Claim 1 , wherein the 
charged molecules are DNA fragments, each fragment being tagged with an atUched fluorescent dye 
that identifies the fragment's end base. 



30 



11, Apparatus for detecting fluorescent molecules as defined in Claim 1 » wherein the 
fluorescent Ugs are four primer dyes, FITC, TRITC, NBD-fluoride, and Texas Red, corresponding 
to the bases C, G, A, and T, respectively. 



35 



12. Apparatus for detecting fluorescent molecules as defined in Claim 1, wherein the 
light source generates excitation light at wavelengths of about 488 nanometers and 514 nanometers. 
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the lanes are on a planar substrate; 

the excitation light is a laser beam that travels along the image line to 
simultaneously illuminate the lanes. 

5 

14. Apparatus for detecting fluorescent molecules as defmed in Claim 13, further 
comprising a mirror that reflects the laser beam back through the read zone along the image line to 
increase the int^sity and unifortnity of the fluorescent light. 

10 15. Apparatus for detecting fluorescent molecules as defined in Claim 13, further 

comprising a lens that couples the laser beam from a laser to the gel to increase the intensity of the 
excitation light traveling along the image line. 

16. Apparatus for detecting fluorescent molecules as defmed in Claim 1, further 
15 comprising: 

optical fibers for coupling the fluorescent light from the image line to an entrance 
slit on the imaging spectrometer; 

a cylindrical lens that focuses the fluorescent light from the imaging line onto ends 
of the optical fibers; and 

20 a mirror located behind the read zone such that fluorescent light travelling away 

from the fiber end is reflected back toward the optical fiber ends. 

17. Apparatus for detecting fluorescent molecules as defined in Claim 1 , wherein the 
second dimension of the imaging plane corresponds to the spectral range from about 450 nanometers 

25 to about 900 nanometers. 

18. Apparatus for detecting fluorescent molecules as defmed in Claim 1. wherein: 

the excitation light is narrowband light having a wavelength within a spectral range 
of the imaging spectrometer such that exciUtion light scattered by the electrophoresis gel is imaged 
30 on a pixel array on the imaging plane; 

the electrical signals, generated by the pixel array based on the intensity of light 
collected by pixels of the pixel array corresponding to the wavelength of the excitotion light, provide 
a monitor of the excitation light's intensity. 

35 19. Apparatus for detecting fluorescent molecules as defined in Claim 1, further 

comprising a buffer to optically couple the light from the light source to electrophoresis medium. 
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20. Apparatus for detecting fluorescent molecules as defined in Claim 1 , further 
comprising first and second electrodes for applying the electrical potential to the lanes at the first and 
second ends, respectively, and extending across all of the plurality of lanes. 

5 21 . Apparatus for detecting fluorescent molecules as defmed in Claim 1 , further 

comprising a plurality of loading electrodes situated near the first electrode, each loading electrode 
associated with a lane for loading the charged molecules into the lanes. 

22. Apparatus for detecting fluorescent molecules as defined in Claim 1 , wherein the 
10 electrophoresis medium is a gel having a thickness of 200 microns which is sandwiched between two 

glass plates. 

23. Apparatus for detecting fluorescent molecules as defined in Claim 1, wherein the 
read zone farther includes fluorescent markers at each end of the image line for indicating the read 

15 zone. 

24. Apparatus for detecting fluorescent molecules as defined in Claim 1, wherein the 
plurality of electrophoresis lanes comprise 384 separate parallel lanes. 

20 25. Apparatus for detecting fluorescent molecules as defined in Claim 1 , wherein the 

imaging spectrometer simultaneously images the fluorescent light onto the two- dimensional imaging 
plane without any scanning motions or delays. 

26. Apparatus for detecting fluorescent molecules as defined in Claim 1 , wherein the 
25 plurality of electrophoresis lanes comprise a capillary electrophoresis array. 

27. Apparatus for detecting fluorescent molecules as defined in Claim 1, further 
comprising a heater that is thermally coupled to the electrophoresis medium for mainuining the 
medium within a predetennined temperature range. 

30 

28. A method for sequencing DNA, comprising: 

producing DNA fragments that are lagged with fluorescent dyes» the dyes 
indicating an end base associated with the respective DNA fragment; 

separating the DNA fragments according to mobility using electrophoresis of the 
35 fragments on a plurality of electrophoresis lanes* the separated DNA fragments forming fragment 
groups of slightly different mobility; 
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exciting the fragment groups with excitation light to cause the fluorescent tags to 

fluoresce; 

forming a series of hyperspectral images of the separated DN A fragments, the 
hyperspectral images simultaneously covering all of the electrophoresis lanes and a broad spectral 
5 range; and 

identifying the fluorescent dye associated with DNA fragments of particular 
molecular weight by fitting the spectra emitted by a fragment group with reference spectra associated 
with the fluorescent dyes. 

10 29. A method of identifying molecules tagged with fluorescent dyes, comprising; 

forming a hyperspectral profile of the molecules* fluorescence emission, the 
hyperspectral profile covering a wavelength range from 450 nanometers to 900 nanometers with a 
resolution of less than 5 nanometers; 

convolutxng, for each dye. the hyperspectral profile with a reference spectral profile 
15 of each dye over the entire spectral range to generate a weighting factor for each dye; 

indicating, based on the correlation factor, the contribution of a particular dye to 
the molecules' fluorescence emission. 

30. A method for loading charged molecules of differing mobilities into a plurality of 
20 closely spaced electrophoresis lanes, each lane having first electrophoresis electrodes located at a first 
end of the lane, a second electrophoresis electrodes located at a second end of the lane, and a loading 
electrode located near the lane's first end, comprising: 

applying solution having the charged molecules over the loading electrode of the 

first lane; 

25 applying a first voltage to the first and second electrophoresis electrodes of the first 

lanes, the first voltage having the same polarity as the charged molecules; 

loading the charged molecules into the first lane by applying a large second voltage, 
having a polarity opposite the polarity of the charged molecules, to the loading electrode of the first 
lane; 

30 while applying a third voltage to the loading electrode of previously loaded lanes, 

the third voltage being a reduced value of the first voltage, repeating the loading process for the 
remaining electrophoresis lanes; 

after loading the lanes, applying a fourth voltage between the first electrophoresis 
electrode and the second electrophoresis electrode to cause the charged molecules to migrate through 

35 the electrophoresis lane at a rate proportional to the molecules mobility. 
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31. A method of loading electrophoresis lanes as defined in Claim 24, wherein: 
the charged molecules are negatively charged DNA fragments; 

the first voltage is between about -1 and -2 volts; 
the second voltage is between about + 1 and +2 volts; 
5 the third voltage is between about -0. 1 and -0.2 volts; and 

the fourth voltage is between about -2000 and -4000 volts. 

32. A method of identifying fluorescently-tagged DNA base fragment groups, 
comprising; 

10 exciting the fragment groups with excitation light to cause a fluorescent tag 

associated with a respective fragment group to fluoresce and produce fluorescent light; 

detecting the fluorescent light emitted by each fragment group over a sufficiently 
large wavel^gth range to produce a spectral profile of the fluorescent light from each fragment 
group; 

15 comparing the detected spectral profile from each fragment group with reference 

spectral profiles associated with the fluorescent tags to generate a weighting factor; 

identifying the fluorescent tags associated with each fragment group based on the 
fragment group's weighting factor. 

20 33. A DNA base identification method as defined by claim 32, wherein the wavelength 

range extends from about 450 nanometers to about 900 nanometers. 

34. A method for identifying molecular fragments of differing mobilities, comprising: 
tagging the molecular fragments with various fluorescent dyes so as to associate 
25 differing dyes with molecular fragments having differing values of a charcleristic; 

sq>arating the molecular fragments according to mobility to form fragment groups 
of slightly different mobilities; 

exciting the separated fragment groups with exciution light to cause the fluorescent 

tags to fluoresce; 

30 forming hyperspectral images of the separated fragments groups, the hyperspectral 

images simultaneously covering a broad spectral range; and 

identifying the fluorescent dyes by fitting the spectra emitted by a fragment group 
with reference spectra associated with the fluorescent dyes. 



35 



35. A method for identifying molecular fragments as defmed in claim 34, wherein the 
hyperspectral image covers a wavelength range associated with the fluorescent dyes with a resolution 
of less than 5 nanometers. 
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36. A method for identifying molecular fragments as defined in claim 34. wherein the 
molecular fragments are DNA fragments and the separation of the fragment groups identifies the 
DNA's sequence. 

5 37. A method for identifying molecular fragments as defined in claim 34, wherein the 

molecular fragments are DNA fragments separated according to the DNA fragment's mobility using 
an electrophoresis gel. 



10 



38, A method for identifying molecular fragments as defined in claim 34, wherein the 
electrophoresis gel is configured in a capillary electrophoresis array. 
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